91 research outputs found

    NcDNAlign: Plausible multiple alignments of non-protein-coding genomic sequences

    Get PDF
    Genome-wide multiple sequence alignments (MSAs) are a necessary prerequisite for an increasingly diverse collection of comparative genomic approaches. Here we present a versatile method that generates high-quality MSAs for non-protein-coding sequences. The NcDNAlign pipeline combines pairwise BLAST alignments to create initial MSAs, which are then locally improved and trimmed. The program is optimized for speed and hence is particulary well-suited to pilot studies. We demonstrate the practical use of NcDNAlign in three case studies: the search for ncRNAs in gammaproteobacteria and the analysis of conserved noncoding DNA in nematodes and teleost fish, in the latter case focusing on the fate of duplicated ultra-conserved regions. Compared to the currently widely used genome-wide alignment program TBA, our program results in a 20- to 30-fold reduction of CPU time necessary to generate gammaproteobacterial alignments. A showcase application of bacterial ncRNA prediction based on alignments of both algorithms results in similar sensitivity, false discovery rates, and up to 100 putatively novel ncRNA structures. Similar findings hold for our application of NcDNAlign to the identification of ultra-conserved regions in nematodes and teleosts. Both approaches yield conserved sequences of unknown function, result in novel evolutionary insights into conservation patterns among these genomes, and manifest the benefits of an efficient and reliable genome-wide alignment package. The software is available under the GNU Public License at http://www.bioinf.uni-leipzig.de/Software/NcDNAlign/

    The Effect of RNA Secondary Structures on RNA-Ligand Binding and the Modifier RNA Mechanism: A Quantitative Model

    Get PDF
    RNA-ligand binding often depends crucially on the local RNA secondary structure at the binding site. We develop here a model that quantitatively predicts the effect of RNA secondary structure on effective RNA-ligand binding activities based on equilibrium thermodynamics and the explicit computations of partition functions for the RNA structures. A statistical test for the impact of a particular structural feature on the binding affinities follows directly from this approach. The formalism is extended to describing the effects of hybridizing small \modifier RNAs' to a target RNA molecule outside its ligand binding site. We illustrate the applicability of our approach by quantitatively describing the interaction of the mRNA stabilizing protein HuR with AU-rich elements [Meisner et al. (2004), Chem. Biochem. in press]. We discuss our model and recent experimental findings demonstrating the ffectivity of modifier RNAs in vitro in the context of the current research activities in the field of non-coding RNAs. We speculate that modifier RNAs might also exist in nature; if so, they present an additional regulatory layer for fine-tuning gene expression that could evolve rapidly, leaving no obvious traces in the genomic DNA sequences

    Computational RNomics of Drosophilids

    Get PDF
    Recent experimental and computational studies have provided overwhelming evidence for a plethora of diverse transcripts that are unrelated to protein-coding genes. One subclass consists of those RNAs that require distinctive secondary structure motifs to exert their biological function and hence exhibit distinctive patterns of sequence conservation characteristic for positive selection on RNA secondary structure. The deep-sequencing of 12 drosophilid species coordinated by the NHGRI provides an ideal data set of comparative computational approaches to determine those genomic loci that code for evolutionarily conserved RNA motifs. This class of loci includes the majority of the known small ncRNAs as well as structured RNA motifs in mRNAs. We report here on a genome-wide survey using RNAz

    A multi-split mapping algorithm for circular RNA, splicing, trans-splicing and fusion detection

    Get PDF
    Numerous high-throughput sequencing studies have focused on detecting conventionally spliced mRNAs in RNA-seq data. However, non-standard RNAs arising through gene fusion, circularization or trans-splicing are often neglected. We introduce a novel, unbiased algorithm to detect splice junctions from single-end cDNA sequences. In contrast to other methods, our approach accommodates multi-junction structures. Our method compares favorably with competing tools for conventionally spliced mRNAs and, with a gain of up to 40% of recall, systematically outperforms them on reads with multiple splits, trans-splicing and circular products

    Thermodynamics of RNA-RNA binding

    Get PDF
    Background: Reliable prediction of RNA–RNA binding energies is crucial, e.g. for the understanding on RNAi, microRNA–mRNA binding and antisense interactions. The thermodynamics of such RNA–RNA interactions can be understood as the sum of two energy contributions: (1) the energy necessary to ‘open’ the binding site and (2) the energy gained from hybridization. Methods: We present an extension of the standard partition function approach to RNA secondary structures that computes the probabilities Pu[i, j] that a sequence interval [i, j] is unpaired. Results: Comparison with experimental data shows that Pu[i, j] can be applied as a significant determinant of local target site accessibility for RNA interference (RNAi). Furthermore, these quantities can be used to rigorously determine binding free energies of short oligomers to large mRNA targets. The resource consumption is comparable with a single partition function computation for the large target molecule. We can show that RNAi efficiency correlates well with the binding energies of siRNAs to their respective mRNA target

    Interleukin-6-dependent survival of multiple myeloma cells involves the Stat3-mediated induction of micro-RNA-21 through a highly conserved enhancer

    Get PDF
    Signal transducer and activator of transcription 3 (Stat3) is implicated in the pathogenesis of many malignancies and essential for IL-6–dependent survival and growth of multiple myeloma cells. Here, we demonstrate that the gene encoding oncogenic microRNA-21 (miR-21) is controlled by an upstream enhancer containing 2 Stat3 binding sites strictly conserved since the first observed evolutionary appearance of miR-21 and Stat3. MiR-21 induction by IL-6 was strictly Stat3 dependent. Ectopically raising miR-21 expression in myeloma cells in the absence of IL-6 significantly reduced their apoptosis levels. These data provide strong evidence that miR-21 induction contributes to the oncogenic potential of Stat3

    Reliability of human retina organoid generation from hiPSC-derived neuroepithelial cysts

    Get PDF
    The possible applications for human retinal organoids (HROs) derived from human induced pluripotent stem cells (hiPSC) rely on the robustness and transferability of the methodology for their generation. Standardized strategies and parameters to effectively assess, compare, and optimize organoid protocols are starting to be established, but are not yet complete. To advance this, we explored the efficiency and reliability of a differentiation method, called CYST protocol, that facilitates retina generation by forming neuroepithelial cysts from hiPSC clusters. Here, we tested seven different hiPSC lines which reproducibly generated HROs. Histological and ultrastructural analyses indicate that HRO differentiation and maturation are regulated. The different hiPSC lines appeared to be a larger source of variance than experimental rounds. Although previous reports have shown that HROs in several other protocols contain a rather low number of cones, HROs from the CYST protocol are consistently richer in cones and with a comparable ratio of cones, rods, and Müller glia. To provide further insight into HRO cell composition, we studied single cell RNA sequencing data and applied CaSTLe, a transfer learning approach. Additionally, we devised a potential strategy to systematically evaluate different organoid protocols side-by-side through parallel differentiation from the same hiPSC batches: In an explorative study, the CYST protocol was compared to a conceptually different protocol based on the formation of cell aggregates from single hiPSCs. Comparing four hiPSC lines showed that both protocols reproduced key characteristics of retinal epithelial structure and cell composition, but the CYST protocol provided a higher HRO yield. So far, our data suggest that CYST-derived HROs remained stable up to at least day 200, while single hiPSC-derived HROs showed spontaneous pathologic changes by day 200. Overall, our data provide insights into the efficiency, reproducibility, and stability of the CYST protocol for generating HROs, which will be useful for further optimizing organoid systems, as well as for basic and translational research applications

    Fast Mapping of Short Sequences with Mismatches, Insertions and Deletions Using Index Structures

    Get PDF
    With few exceptions, current methods for short read mapping make use of simple seed heuristics to speed up the search. Most of the underlying matching models neglect the necessity to allow not only mismatches, but also insertions and deletions. Current evaluations indicate, however, that very different error models apply to the novel high-throughput sequencing methods. While the most frequent error-type in Illumina reads are mismatches, reads produced by 454's GS FLX predominantly contain insertions and deletions (indels). Even though 454 sequencers are able to produce longer reads, the method is frequently applied to small RNA (miRNA and siRNA) sequencing. Fast and accurate matching in particular of short reads with diverse errors is therefore a pressing practical problem. We introduce a matching model for short reads that can, besides mismatches, also cope with indels. It addresses different error models. For example, it can handle the problem of leading and trailing contaminations caused by primers and poly-A tails in transcriptomics or the length-dependent increase of error rates. In these contexts, it thus simplifies the tedious and error-prone trimming step. For efficient searches, our method utilizes index structures in the form of enhanced suffix arrays. In a comparison with current methods for short read mapping, the presented approach shows significantly increased performance not only for 454 reads, but also for Illumina reads. Our approach is implemented in the software segemehl available at http://www.bioinf.uni-leipzig.de/Software/segemehl/

    The Role of lncRNAs TAPIR-1 and -2 as Diagnostic Markers and Potential Therapeutic Targets in Prostate Cancer

    Get PDF
    In search of new biomarkers suitable for the diagnosis and treatment of prostate cancer, genome-wide transcriptome sequencing was carried out with tissue specimens from 40 prostate cancer (PCa) and 8 benign prostate hyperplasia patients. We identified two intergenic long non-coding transcripts, located in close genomic proximity, which are highly expressed in PCa. Microarray studies on a larger cohort comprising 155 patients showed a profound diagnostic potential of these transcripts (AUC~0.94), which we designated as tumor associated prostate cancer increased lncRNA (TAPIR-1 and -2). To test their therapeutic potential, knockdown experiments with siRNA were carried out. The knockdown caused an increase in the p53/TP53 tumor suppressor protein level followed by downregulation of a large number of cell cycle- and DNA-damage repair key regulators. Furthermore, in radiation therapy resistant tumor cells, the knockdown leads to a renewed sensitization of these cells to radiation treatment. Accordingly, in a preclinical PCa xenograft model in mice, the systemic application of nanoparticles loaded with siRNA targeting TAPIR-1 significantly reduced tumor growth. These findings point to a crucial role of TAPIR-1 and -2 in PCa
    • …
    corecore